chore: bump version to 0.1.5 by ternaus · Pull Request #93 · albumentations-team/albucore

ternaus · 2026-04-07T05:10:13Z

extend reduce_sum to accept any numpy dtype (float64, int64, bool, etc.) falling back to numpy with appropriate accumulator (float64 for floats, int64 for integers/bool) instead of raising ValueError
update test_unsupported_dtype_raises → split into mean/std test and a new test_reduce_sum_accepts_any_dtype covering float64, int64, bool

Made-with: Cursor

Summary by Sourcery

Allow reduce_sum to operate on a wider range of NumPy dtypes while keeping optimized paths for uint8 and float32, and bump the package version.

New Features:

Support arbitrary NumPy dtypes in reduce_sum, falling back to NumPy with appropriate float64 or int64 accumulation.

Enhancements:

Adjust reduce_sum behavior to no longer raise on non-uint8/float32 dtypes when using global or per-channel reductions and instead route them through the generic NumPy path.

Build:

Bump project version from 0.1.4 to 0.1.5.

Tests:

Split unsupported-dtype test for mean/std from reduce_sum and add coverage ensuring reduce_sum works with various integer, float, and boolean dtypes.

- extend reduce_sum to accept any numpy dtype (float64, int64, bool, etc.) falling back to numpy with appropriate accumulator (float64 for floats, int64 for integers/bool) instead of raising ValueError - update test_unsupported_dtype_raises → split into mean/std test and a new test_reduce_sum_accepts_any_dtype covering float64, int64, bool Made-with: Cursor

sourcery-ai · 2026-04-07T05:10:22Z

Reviewer's Guide

Extends reduce_sum to handle any NumPy dtype by routing non-uint8/float32 arrays through a generic NumPy-based reduction with appropriate accumulator dtypes, updates tests to reflect the new behavior, and bumps the package version to 0.1.5.

Sequence diagram for extended reduce_sum dtype handling

sequenceDiagram
    actor User
    participant reduce_sum
    participant _reduce_sum_global_uint8
    participant _reduce_sum_global_float32
    participant _reduce_sum_per_channel_uint8
    participant _reduce_sum_per_channel_float32
    participant _reduce_sum_numpy
    participant numpy

    User->>reduce_sum: reduce_sum(arr, axis, keepdims)
    alt axis is None or global
        alt _is_uint8_image(arr)
            reduce_sum->>_reduce_sum_global_uint8: _reduce_sum_global_uint8(arr, keepdims)
            _reduce_sum_global_uint8-->>reduce_sum: uint64 or ndarray
        else _is_float32_image(arr)
            reduce_sum->>_reduce_sum_global_float32: _reduce_sum_global_float32(arr, keepdims)
            _reduce_sum_global_float32-->>reduce_sum: float32 or ndarray
        else other dtype
            reduce_sum->>_reduce_sum_numpy: _reduce_sum_numpy(arr, axes=None, keepdims)
            _reduce_sum_numpy->>numpy: sum(arr, axis=None, dtype=acc, keepdims)
            numpy-->>_reduce_sum_numpy: result
            _reduce_sum_numpy-->>reduce_sum: result
        end
    else axis equals _per_channel_spatial_axes(arr)
        alt _is_uint8_image(arr)
            reduce_sum->>_reduce_sum_per_channel_uint8: _reduce_sum_per_channel_uint8(arr, keepdims)
            _reduce_sum_per_channel_uint8-->>reduce_sum: uint64 or ndarray
        else _is_float32_image(arr)
            reduce_sum->>_reduce_sum_per_channel_float32: _reduce_sum_per_channel_float32(arr, axes, keepdims)
            _reduce_sum_per_channel_float32-->>reduce_sum: float32 or ndarray
        else other dtype
            reduce_sum->>_reduce_sum_numpy: _reduce_sum_numpy(arr, axes, keepdims)
            _reduce_sum_numpy->>numpy: sum(arr, axis=axes, dtype=acc, keepdims)
            numpy-->>_reduce_sum_numpy: result
            _reduce_sum_numpy-->>reduce_sum: result
        end
    else explicit axes
        reduce_sum->>_reduce_sum_numpy: _reduce_sum_numpy(arr, axes, keepdims)
        _reduce_sum_numpy->>numpy: sum(arr, axis=axes, dtype=acc, keepdims)
        numpy-->>_reduce_sum_numpy: result
        _reduce_sum_numpy-->>reduce_sum: result
    end
    reduce_sum-->>User: reduced result

File-Level Changes

Change	Details	Files
Generalize reduce_sum’s NumPy fallback to support arbitrary NumPy dtypes with appropriate accumulator selection.	Change _reduce_sum_numpy to accept a generic np.ndarray instead of ImageType. Select accumulator dtype based on arr.dtype: uint8 → uint64, any floating dtype → float64, other dtypes (integers/bool) → int64. Delegate all non-optimized reduce_sum paths to _reduce_sum_numpy instead of raising ValueError for unsupported dtypes. Adjust control flow in reduce_sum to use elif for the per-channel axes branch so non-global axes automatically use the NumPy path.	`albucore/stats.py`
Update stats tests to distinguish mean/std dtype constraints from reduce_sum’s expanded dtype support.	Rename test_unsupported_dtype_raises to test_unsupported_dtype_raises_mean_std and keep it focused on mean/std/mean_std raising on unsupported dtypes. Add test_reduce_sum_accepts_any_dtype parametrized over int32, int64, float64, and bool dtypes, asserting correct global and per-channel sums and shapes.	`tests/test_stats.py`
Bump library version to 0.1.5 and sync lockfile.	Update project version from 0.1.4 to 0.1.5 in pyproject.toml. Update uv.lock to reflect the new version and any dependency metadata changes.	`pyproject.toml` `uv.lock`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey - I've found 2 issues, and left some high level feedback:

In _reduce_sum_numpy, the return type annotation (np.uint64 | np.float64 | np.ndarray) no longer quite matches the behavior now that integer/bool paths can return np.int64 scalars; consider tightening the hint (e.g., to np.generic | np.ndarray) or explicitly including np.int64.
The accumulator selection in _reduce_sum_numpy treats all non-uint8 integer and bool dtypes as int64; if you expect larger unsigned types (e.g., uint32/uint64), consider branching on signedness and itemsize to avoid surprising overflow or sign changes.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- In `_reduce_sum_numpy`, the return type annotation (`np.uint64 | np.float64 | np.ndarray`) no longer quite matches the behavior now that integer/bool paths can return `np.int64` scalars; consider tightening the hint (e.g., to `np.generic | np.ndarray`) or explicitly including `np.int64`.
- The accumulator selection in `_reduce_sum_numpy` treats all non-`uint8` integer and bool dtypes as `int64`; if you expect larger unsigned types (e.g., `uint32`/`uint64`), consider branching on signedness and itemsize to avoid surprising overflow or sign changes.

## Individual Comments

### Comment 1
<location path="tests/test_stats.py" line_range="364-371" />
<code_context>
-        reduce_sum(arr, "per_channel")
+
+
+@pytest.mark.parametrize("dtype", [np.int32, np.int64, np.float64, np.bool_])
+def test_reduce_sum_accepts_any_dtype(dtype: type) -> None:
+    arr = np.ones((4, 4, 3), dtype=dtype)
+    result = reduce_sum(arr)
+    assert result == arr.size
+    per_ch = reduce_sum(arr, "per_channel")
+    assert per_ch.shape == (3,)
+    assert np.all(per_ch == 16)


</code_context>
<issue_to_address>
**suggestion (testing):** Extend `test_reduce_sum_accepts_any_dtype` to actually exercise the accumulator behavior (overflow/precision) rather than just trivial ones.

This test only checks that `reduce_sum` works for various dtypes and preserves shape/value semantics. It doesn’t verify the new accumulator behavior (`float64` for floats, `int64` for ints/bools, `uint64` for `uint8`). With all-ones inputs, you won’t see overflow or precision issues. Please add a parametrized test (or subtests here) with inputs near `int32`/`int64` limits and a float case sensitive to precision, then assert that the result and/or result dtype matches `np.sum(arr, dtype=<expected_acc>)`. That will lock in the chosen accumulator dtypes and catch regressions.
</issue_to_address>

### Comment 2
<location path="tests/test_stats.py" line_range="353-354" />
<code_context>


 @pytest.mark.parametrize("dtype", [np.int32, np.float64])
-def test_unsupported_dtype_raises(dtype: type) -> None:
+def test_unsupported_dtype_raises_mean_std(dtype: type) -> None:
     arr = np.ones((2, 2, 1), dtype=dtype)
     with pytest.raises(ValueError, match="Unsupported dtype"):
</code_context>
<issue_to_address>
**suggestion (testing):** Broaden `test_unsupported_dtype_raises_mean_std` to cover additional unsupported dtypes (e.g. bool, complex) to better document mean/std constraints.

The new name is clearer, but the test still only covers `np.int32` and `np.float64`. Since `mean`/`std`/`mean_std` should reject all unsupported dtypes, please add at least one non-integer/non-float case (e.g. `np.bool_` and/or `np.complex64`) to assert that these also raise `ValueError`. This will better document the supported dtypes and guard against unintentionally accepting new dtypes without updating tests.

```suggestion
@pytest.mark.parametrize("dtype", [np.int32, np.float64, np.bool_, np.complex64])
def test_unsupported_dtype_raises_mean_std(dtype: type) -> None:
```
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

tests/test_stats.py

sourcery-ai · 2026-04-07T05:11:43Z

tests/test_stats.py

 @pytest.mark.parametrize("dtype", [np.int32, np.float64])
-def test_unsupported_dtype_raises(dtype: type) -> None:


suggestion (testing): Broaden test_unsupported_dtype_raises_mean_std to cover additional unsupported dtypes (e.g. bool, complex) to better document mean/std constraints.

The new name is clearer, but the test still only covers np.int32 and np.float64. Since mean/std/mean_std should reject all unsupported dtypes, please add at least one non-integer/non-float case (e.g. np.bool_ and/or np.complex64) to assert that these also raise ValueError. This will better document the supported dtypes and guard against unintentionally accepting new dtypes without updating tests.

Suggested change

@pytest.mark.parametrize("dtype", [np.int32, np.float64])

def test_unsupported_dtype_raises(dtype: type) -> None:

@pytest.mark.parametrize("dtype", [np.int32, np.float64, np.bool_, np.complex64])

def test_unsupported_dtype_raises_mean_std(dtype: type) -> None:

Copilot

Pull request overview

This PR bumps albucore to v0.1.5 and updates reduce_sum to support a broader set of NumPy dtypes by falling back to numpy.sum with dtype-dependent accumulator selection, alongside updated tests.

Changes:

Bump package version from 0.1.4 → 0.1.5.
Extend reduce_sum to accept non-uint8/float32 arrays via NumPy fallback with accumulator selection.
Update/rename the unsupported-dtype test for mean/std and add a new test validating reduce_sum behavior on additional dtypes.

Reviewed changes

Copilot reviewed 3 out of 4 changed files in this pull request and generated 4 comments.

File	Description
`albucore/stats.py`	Allows `reduce_sum` to fall back to NumPy for non-optimized dtypes and updates documentation.
`tests/test_stats.py`	Splits unsupported-dtype test coverage and adds coverage for broader `reduce_sum` dtype support.
`pyproject.toml`	Bumps project version to `0.1.5`.
`uv.lock`	Updates locked package version to `0.1.5`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-07T05:13:06Z

albucore/stats.py

+    if arr.dtype == np.uint8:
+        acc: type = np.uint64
+    elif np.issubdtype(arr.dtype, np.floating):
+        acc = np.float64


_reduce_sum_numpy selects np.int64 as the accumulator for all non-uint8, non-floating dtypes. This includes unsigned integer arrays (e.g. uint16/uint32/uint64), where casting to int64 can produce incorrect negative results for values > 2**63-1 and is generally inconsistent with unsigned semantics. Consider using a np.uint64 accumulator for np.unsignedinteger dtypes (and keep np.int64 for signed ints/bool), so sums remain correct across all integer dtypes.

Suggested change

acc = np.float64

acc = np.float64

elif np.issubdtype(arr.dtype, np.unsignedinteger):

acc = np.uint64

Copilot · 2026-04-07T05:13:06Z

albucore/stats.py

    Args:
-        arr: ``uint8`` or ``float32`` array with explicit channel dimension.
+        arr: Array with explicit channel dimension. Optimised paths for ``uint8`` and ``float32``;
+            other dtypes fall back to NumPy (float → float64 accumulator, integer/bool → int64).
        axis: ``None`` / ``"global"`` → one scalar; ``"per_channel"`` → shape ``(C,)``;


The docstring now states that reduce_sum accepts dtypes beyond uint8/float32 via a NumPy fallback, but the function signature still types arr as ImageType (uint8|float32). This mismatch will cause static type checkers to reject valid runtime usage (e.g. int64/bool). Consider widening arr’s type annotation (e.g. np.ndarray / NDArray[Any]) or adding a broader alias for stats inputs so the public API matches the documented behavior.

Copilot · 2026-04-07T05:13:06Z

albucore/stats.py

 def _reduce_sum_numpy(
-    arr: ImageType,
+    arr: np.ndarray,
    axes: tuple[int, ...] | None,
    *,
    keepdims: bool,
 ) -> np.uint64 | np.float64 | np.ndarray:
-    acc = np.uint64 if arr.dtype == np.uint8 else np.float64
+    if arr.dtype == np.uint8:
+        acc: type = np.uint64
+    elif np.issubdtype(arr.dtype, np.floating):
+        acc = np.float64
+    else:
+        acc = np.int64
    return np.sum(arr, axis=axes, dtype=acc, keepdims=keepdims)


_reduce_sum_numpy can now return np.int64 (for integer/bool inputs when axes=None), but its return annotation is still np.uint64 | np.float64 | np.ndarray. Please update the return type to include the int64 case (or use a broader scalar type like np.integer | np.floating) so type hints match runtime behavior.

tests/test_stats.py

- _reduce_sum_numpy: branch on np.issubdtype(unsigned) → uint64 so uint32/uint64 inputs don't silently sign-extend into int64 - return type changed from uint64|float64|ndarray to generic|ndarray since int64 and uint64 scalars are now both valid outputs - test_unsupported_dtype_raises_mean_std: add bool_ and complex64 to the parametrize list per review suggestion - add test_reduce_sum_accumulator_dtype: locks in expected accumulator dtype for int32, int64, uint32, uint64, float64, bool_ - add test_reduce_sum_overflow_and_precision: exercises near-max int32, near-max uint32 (sign safety), and float64 precision cases Made-with: Cursor

The function accepts any dtype via the numpy fallback path; keeping ImageType (uint8 | float32) caused static type checkers to reject valid runtime calls with int64, bool_, etc. Made-with: Cursor

Copilot AI review requested due to automatic review settings April 7, 2026 05:10

Copilot started reviewing on behalf of ternaus April 7, 2026 05:10 View session

sourcery-ai bot reviewed Apr 7, 2026

View reviewed changes

Copilot AI reviewed Apr 7, 2026

View reviewed changes

ternaus added 2 commits April 7, 2026 14:13

fix(stats): widen reduce_sum arr param from ImageType to np.ndarray

e5fce5b

The function accepts any dtype via the numpy fallback path; keeping ImageType (uint8 | float32) caused static type checkers to reject valid runtime calls with int64, bool_, etc. Made-with: Cursor

ternaus merged commit 5ffe876 into main Apr 7, 2026
6 checks passed

ternaus deleted the release/0.1.5 branch April 7, 2026 05:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: bump version to 0.1.5#93

chore: bump version to 0.1.5#93
ternaus merged 3 commits intomainfrom
release/0.1.5

ternaus commented Apr 7, 2026 •

edited by sourcery-ai bot

Loading

Uh oh!

sourcery-ai bot commented Apr 7, 2026 •

edited

Loading

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Uh oh!

Uh oh!

sourcery-ai bot Apr 7, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 7, 2026

Uh oh!

Copilot AI Apr 7, 2026

Uh oh!

Copilot AI Apr 7, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@pytest.mark.parametrize("dtype", [np.int32, np.float64])
		def test_unsupported_dtype_raises(dtype: type) -> None:

Uh oh!

Conversation

ternaus commented Apr 7, 2026 • edited by sourcery-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by Sourcery

Uh oh!

sourcery-ai bot commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

Sequence diagram for extended reduce_sum dtype handling

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sourcery-ai bot Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ternaus commented Apr 7, 2026 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented Apr 7, 2026 •

edited

Loading